In the mathematical theory of

random processes In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables. Stochastic processes are widely used as mathematical models of systems and phenomena that appea ...

, the Markov chain central limit theorem has a conclusion somewhat similar in form to that of the classic

central limit theorem In probability theory, the central limit theorem (CLT) establishes that, in many situations, when independent random variables are summed up, their properly normalized sum tends toward a normal distribution even if the original variables themselv ...

(CLT) of probability theory, but the quantity in the role taken by the variance in the classic CLT has a more complicated definition. See also the general form of Bienaymé's identity.

Statement

Suppose that: * the sequence

X_1,X_2,X_3,\ldots

random element In probability theory, random element is a generalization of the concept of random variable to more complicated spaces than the simple real line. The concept was introduced by who commented that the “development of probability theory and expansio ...

s of some set is a

Markov chain A Markov chain or Markov process is a stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. Informally, this may be thought of as, "What happe ...

that has a

stationary probability distribution Stationary distribution may refer to: * A special distribution for a Markov chain such that if the chain starts with its stationary distribution, the marginal distribution of all states at any time will always be the stationary distribution. Assum ...

; and * the initial distribution of the process, i.e. the distribution of

X_1

, is the stationary distribution, so that

X_1,X_2,X_3,\ldots

are identically distributed. In the classic central limit theorem these random variables would be assumed to be

independent Independent or Independents may refer to: Arts, entertainment, and media Artist groups * Independents (artist group), a group of modernist painters based in the New Hope, Pennsylvania, area of the United States during the early 1930s * Independ ...

, but here we have only the weaker assumption that the process has the

Markov property In probability theory and statistics, the term Markov property refers to the memoryless property of a stochastic process. It is named after the Russian mathematician Andrey Markov. The term strong Markov property is similar to the Markov propert ...

; and *

g

is some (measurable) real-valued function for which

\operatorname(g(X_1)) <+\infty.

Now let :

\begin
\mu & = \operatorname E(g(X_1)), \\
\widehat\mu_n & = \frac 1 n \sum_^n g(X_k)\\
\sigma^2 & := \lim_ \operatorname(\sqrt\widehat\mu_n) = \lim_ n \operatorname(\widehat\mu_n) = \operatorname(g(X_1)) + 2\sum_^\infty \operatorname( g(X_1), g(X_)).
\end

Then as

n \to\infty,

we haveGeyer, Charles J. (2011). Introduction to Markov Chain Monte Carlo. In ''Handbook of MarkovChain Monte Carlo''. Edited by S. P. Brooks, A. E. Gelman, G. L. Jones, and X. L. Meng. Chapman & Hall/CRC, Boca Raton, FL, Section 1.8. http://www.mcmchandbook.net/HandbookChapter1.pdf :

\sqrt (\hat_n - \mu) \ \xrightarrow \ \text(0, \sigma^2),

where the decorated arrow indicates

convergence in distribution In probability theory, there exist several different notions of convergence of random variables. The convergence of sequences of random variables to some limit random variable is an important concept in probability theory, and its applications to ...

. This means that :

\hat\mu_n \sim \operatorname\left( \mu, \frac n \right),

where ~ means "is distributed like".

Monte Carlo Setting

The Markov chain central limit theorem can be guaranteed for functionals of general state space Markov chains under certain conditions. In particular, this can be done with a focus on Monte Carlo settings. An example of the application in a MCMC (Markov Chain Monte Carlo) setting is the following: Consider a simple

hard spheres Hard spheres are widely used as model particles in the statistical mechanical theory of fluids and solids. They are defined simply as impenetrable spheres that cannot overlap in space. They mimic the extremely strong ("infinitely elastic bouncing" ...

model on a grid. Suppose

X = \ \times \ \subseteq Z^2

. A proper configuration on

X

consists of coloring each point either black or white in such a way that no two adjacent points are white. Let

\chi

denote the set of all proper configurations on

X

N_(n_1, n_2)

be the total number of proper configurations and π be the uniform distribution on

\chi

so that each proper configuration is equally likely. Suppose our goal is to calculate the typical number of white points in a proper configuration; that is, if

W(x)

is the number of white points in

x \in \chi

then we want the value of

E_W=\sum_\frac

n_1

and

n_2

are even moderately large then we will have to resort to an approximation to

E_W

. Consider the following Markov chain on

\chi

. Fix

p \in (0, 1)

and set

X_1 = x_1

where

x_1 \in \chi

is an arbitrary proper configuration. Randomly choose a point

(x, y) \in X

and independently draw

U \sim \mathrm(0, 1)

. If

u \le p

and all of the adjacent points are black then color

(x, y)

white leaving all other points alone. Otherwise, color

(x, y)

black and leave all other points alone. Call the resulting configuration

X_1

. Continuing in this fashion yields a Harris ergodic Markov chain

\

having

\pi

as its invariant distribution. It is now a simple matter to estimate

E_ W

with

\overline=\sum_^ W(X_i)/n

. Also, since

\chi

is finite (albeit potentially large) it is well known that

X

will converge exponentially fast to

\pi

which implies that a CLT holds for

\overline

Implications

Not taking into account the additional terms in the variance which stem from correlations (e.g. serial correlations in markov chain monte carlo simulations) can result in the problem of

pseudoreplication Pseudoreplication (sometimes unit of analysis error) has many definitions. Pseudoreplication was originally defined in 1984 by Stuart H. Hurlbert as the use of inferential statistics to test for treatment effects with data from experiments where ...

when computing e.g. the

confidence interval In frequentist statistics, a confidence interval (CI) is a range of estimates for an unknown parameter. A confidence interval is computed at a designated ''confidence level''; the 95% confidence level is most common, but other levels, such as 9 ...

s for the

sample mean The sample mean (or "empirical mean") and the sample covariance are statistics computed from a Sample (statistics), sample of data on one or more random variables. The sample mean is the average value (or mean, mean value) of a sample (statistic ...

References

{{reflist

Sources

* Gordin, M. I. and Lifšic, B. A. (1978). "Central limit theorem for stationary Markov processes." ''Soviet Mathematics, Doklady'', 19, 392–394. (English translation of Russian original). * Geyer, Charles J. (2011). "Introduction to MCMC." In ''Handbook of Markov Chain Monte Carlo'', edited by S. P. Brooks, A. E. Gelman, G. L. Jones, and X. L. Meng. Chapman & Hall/CRC, Boca Raton, pp. 3–48. Markov processes Markov models Stochastic processes Stochastic models Probability theorems Asymptotic theory (statistics) Normal distribution